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Abstract 

In the (classical) Secretary Problem, one has to hire the best among n candidates. The 
candidates are interviewed, one at a time, at a uniformly random order, and one has to decide 
on the spot, whether to hire a candidate or continue interviewing. It is well known that the 
best candidate can be hired with a probability of 1/e (Dynkin, 1963). Recent works extend 
this problem to settings in which multiple candidates can be hired, subject to some constraint. 
Here, one wishes to hire a set of candidates maximizing a given objective set function. 

Almost all extensions considered in the literature assume the objective set function is either 
linear or submodular. Unfortunately, real world functions might not have either of these prop¬ 
erties. Consider, for example, a scenario where one hires researchers for a project. Indeed, it 
can be that some researchers can substitute others for that matter. However, it can also be that 
some combinations of researchers result in synergy (see, e.g., Woolley et al., Science 2010, for a 
research about collective intelligence). The first phenomenon can be modeled by a submoudlar 
set function, while the latter cannot. 

In this work, we study the secretary problem with an arbitrary non-negative monotone valua¬ 
tion function, subject to a general matroid constraint. It is not difficult to prove that, generally, 
only very poor results can be obtained for this class of objective functions. We tackle this 
hardness by combining the following: (1) Parametrizing our algorithms by the supermodular 
degree of the objective function (defined by Feige and Izsak, ITCS 2013), which, roughly speak¬ 
ing, measures the distance of a function from being submodular. (2) Suggesting an (arguably) 
natural model that permits approximation guarantees that are polynomial in the supermodular 
degree (as opposed to the standard model which allows only exponential guarantees). Our algo¬ 
rithms learn the input by running a non-trivial estimation algorithm on a portion of it whose 
size depends on the supermodular degree. 

We also provide better approximation guarantees for the special case of a uniform matroid 
constraint. To the best of our knowledge, our results represent the first algorithms for a secretary 
problem handling arbitrary non-negative monotone valuation functions. 
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1 Introduction 

In the (classical) Secretary Problem, one has to hire a worker from a pool of n candidates. 
The candidates arrive to an interview at a uniformly random order, and the algorithm must de¬ 
cide immediately and irrevocably, after interviewing a candidate, whether to hire him or continue 
interviewing. The objective is to hire the best candidate. It is well-known that the best candidate 
can be hired with a probability of 1/e, and that this is asymptotically optimal [11]. 

Recently, there has been an increased interest in variants of the secretary problem where more 
than a single candidate can be selected, subject to some constraint (e.g., a matroid constraint). Such 
variants have important applications in mechanism design (see, e.g., mmmm and the references 
therein). When more than one candidate can be selected, there is a meaning to the values of 
subsets of candidates. If one allows these values to be determined by an arbitrary non-negative 
monotone set function, then only exponentially competitive ratios (in the number of candidates) 
can be achieved, even subject to a simple cardinality constraint!]] 

In light of the above hardness, previous works have concentrated on restricted families of ob¬ 
jective functions, such as linear and submodular functions (see, e.g., 0 m El |9l E3 EDI m\ and a 
more thorough discussion in Section T3.21) . However, for many applications, the desired set function 
might admit complements, i.e., a group of candidates might exhibit synergy and contribute more 
as a group than the sum of the candidates’ personal contributions (see also Woolley et al. |29] 
for a research about collective intelligence). Such complements cannot be modeled by submodular 
(or linear) objectives. Dealing with complements in general results in unacceptable guarantees, as 
discussed above. However, what if one has a function which is submodular, except for a pair of 
candidates which are better to hire together? Can we guarantee anything for this case? 

In this paper, we give a strong affirmative answer to this question. Specifically, we give al¬ 
gorithms for secretary problems with arbitrary non-negative monotone objective functions, whose 
guarantees are proportional to the distance of the objective function from being submodular, as 
measured by the supermodular degree (defined by flT]). Back to the above example, the pair of 
synergistic candidates results in an objective function with a supermodular degree of 1, and for 
such an objective our algorithms provide a constant competitive ratio for the problem of hiring a 
team of a given size. Intuitively, the supermodular degree can be seen as measuring the number of 
candidates that any single candidate can have synergy with. Our algorithms handle both the case 
of a cardinality constraint (demonstrated above) and the more general case of a matroid constraint. 
For a cardinality constraint, we obtain a constant competitive ratio when the supermodular degree 
is constant. For a (general) matroid constraint, our competitive ratios depend logarithmically on 
the rank of the matroidH To the best of our knowledge, these are the first algorithms for the 
secretary problem with an arbitrary non-negative monotone objective set function. 

2 Preliminaries 

For completeness of the presentation, we give in this section a few relevant definitions from the 
literature (see, e.g., |15|). All set functions in this work are non-negative and (non-decreasing) 
monotonqfl. For readability, given a set S C Af and an element u £ Af we use S + u to denote 
S U {«} and S — u to denote S \ {u}. 

Tntuitively, the bad example consists of a cardinality constraint allowing us to select only k candidates, and an 
objective function assigning a strictly positive value only to sets containing k specific candidates or more than k 
candidates. In this case the algorithm has no room for mistakes, which leads to a very poor performance. 

2 Note that, till a very short while ago, this was the case even for the state of the art algorithm for a submodular 
objective function (which corresponds to a supermodular degree of 0) [2D]. An improved algorithm with a competitive 
ratio of O(loglogfc) (where fc is the rank of the matroid) has been recently given by 1181 . 

3 A set function /: 2A —¥ R + is monotone if and only if f(S) < f(T) whenever S C T C J\f ■ 
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2.1 Matroids 


Given a ground set A f, a pair {N ,1) is called a matroid if X C 2^ obeys three properties: 

(i) X is non-empty. 

(ii) X is hereditary, i.e., S C T C J\f and T € X imply S GX. 

(iii) For every two sets S,T £ X such that |5| > \T\, there exists an element u € S\T, such that 
T + u G X. This property is called the augmentation property of matroids. 

We say that a set S C JV is independent if S E X. The rank of a matroid is the size of the 
largest independent set in X. One important class of matroids, which is central to our work, is the 
class of uniform matroids. A uniform matroid of rank k is simply a cardinality constraint, i.e., a 
set S C AMs independent in such a matroid if and only if its cardinality is at most k. 

2.2 Supermodular degree 

The following standard definition is very handy. 

Definition 2.1 (Marginal set function). Let f : 2^ —>• M + be a set function and let u € A f. 
The marginal set function of f with respect to u, denoted by f(u | •) is defined as f(u \ S) = 
f(S + u) — f(S). When the underlying set function f is clear from the context, we sometimes call 
f(u | S ) the marginal contribution of u to the set S. Similarly, for subsets S,T C M, we use the 
notation f(T \ S) = f(S U T) — f(S). 

We can now give the definition of the supermodular degree (originally defined by |14j). which 
is used to parameterize our results. 

Definition 2.2 (Supermodular (dependency) degree). The supermodular degree of an element u € 
N with respect to f is defined as the cardinality of the set V^{u) = {u € A^ | ^scAff{u \ S + v) > 
f(u | S)}, containing all elements whose existence in a set might increase the marginal contribution 
of u. V~f(u) is called the supermodular dependency set of u with respect to f, and we sometimes 
refer to the elements ofV^ ( u ) as supermodular dependencies. The supermodular degree of a function 
f, denoted by T>J, is simply the maximum supermodular degree of any element u € A f. Formally, 
VJ = max ue y \V~^(u)\. When the underlying set function is clear from the context, we sometimes 
omit it from the notations. 

Note that 0 < < n — 1 for any set function /. More specifically, T>~jf = 0 when / is 

submodular , and becomes larger as / deviates from submodularity. When the function / is clear 
from the context, we use d to denote T>J. 

2.3 Input representation 

In general, a set function might assign 2 n different values for the subsets of a ground set of size n. 
Thus, not every set function has a succinct (i.e., polynomial in n) representation. Therefore, it is a 
common practice to assume access to a set function via an oracle. That is, an algorithm handling 
a set function often gets access to an oracle that answers queries about the function, instead of 
getting an explicit representation of the function. Arguably, the most basic type of an oracle is the 
value oracle, which, given any subset of the ground set, returns the value assigned to it by the set 
function. Formally: 

Definition 2.3. Value oracle of a set function f : 2^ —>• R + is the following: 

Input: A subset S C J\f . 

Output: f(S). 
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Similarly, since in a given matroid the number of independent subsets might be, in general, 
exponential in the size of the ground set, it is common to assume access to the following type of 
oracle. 

Definition 2.4. Independence oracle of a matroid (Af, X) is the following: 

Input: A subset S C Af. 

Output: A Boolean value indicating whether S € T. 

Additionally, in order to manipulate a function with respect to the supermodular degree, one 
needs a way to determine the supermodular dependencies of a given element of the ground set. An 
oracle for that purpose was introduced by [H], and was later used also by na Formally: 

Definition 2.5. Supermodular oracle of a set function f : 2^ —>• R + is the following: 

Input: An element u € Af. 

Output: The set V + {u) of the supermodular dependencies of u with respect to f. 

The above oracles definitions are acceptable for offline algorithms. In online settings, these 
oracles have to be weakened and limited to return only information that we “expect” the algorithm 
to have. Some possible weakened versions can be found in previous work. Still, finding a set of 
weakened oracles that “makes sense” in the context of the supermodular degree is not trivial. Our 
model, including the weakened oracles that we use, appears in Section [3j 

2.4 Online algorithms 

Like standard online algorithms, the performance of a secretary algorithm is measured by the 
competitive ratio, which is the worst case ratio between the expected performance of the algorithm 
and the performance of an offline optimal algorithm. More formally, let V be the set of possible 
instances, OPT(P) be the value of the optimal solution for an instance P £ V and ALG(P) 
be the value of the algorithm’s solution given the instance P. Then, the competitive ratio (for 
maximization problems) of the algorithm is given by: 

OPT(P) 

PeP E[ALG(P)] ’ 

where the expectation is over the randomness of the algorithm and the arrival order of the input. 

2.5 Techniques 

Most algorithms for secretary problems start with a learning phase in which they reject all elements, 
and later, after accumulating some information about the input, they move to a phase in which they 
may accept elements. When the value of an element might positively depend on other d elements, 
there might be a set of d + 1 elements such that every reasonable solution must contain this set. 
In this case, any reasonably good algorithm must terminate the learning phase, with a significant 
probability, before any element of this set arrives. 

This means that the learning phase of our algorithms consists of only about 1/d of the input 
(where d is the supermodular degree of the objective function), and thus, they rarely see all the 
dependencies of an element in the learning phase. Hence, by the end of the learning phase, our 
algorithms cannot calculate an optimal solution for the sub-problem represented by the part of 
the input seen thus far. However, we show that it is possible to estimate the value of the optimal 
solution based on the learning phase, and this estimation is crucial for the performance of our 
algorithms. 
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3 Model and results 

Consider the following scenario. A client enters a store, and wants to buy herself a new phone. 
However, the client’s main motive to buy this phone is a novel accessory which is not supported 
by her old phone. Thus, the client asks the salesman to buy the phone bundled with the accessory. 
Unfortunately, the accessory is not available at that time, because supply does not meet the over¬ 
whelming demand. If the client insists on buying the phone only bundled with the accessory, then 
the salesman can offer her to buy a phone now and get the accessory next week when a new supply 
shipment arrives. 

On the other hand, consider a slightly different scenario. In this scenario, the client tells the 
salesman that she wants the phone together with some accessory, but does not tell him which 
accessory it is. The client then offers the following deal: the salesman will give her the phone now, 
and the client will pay when the unspecified accessory becomes available. Clearly no salesman can 
accept such an offer. 

Our model (below) assumes the more realistic first scenario. That is, in case of complementarity, 
the “bidder” is required to announce the future elements she needs in order to get the maximum 
value from the current “product”. 

Formally, an instance of the monotone matroid secretary problem consists of a ground set Af 
of size n, a non-negative monotone set function / : 2 ^ —>• R + and a matroid M = The 

execution of an algorithm for this problem consists of n steps (also referred to as times). In each 
step the following occurs: 

• One element of M is revealed (arrives), at a uniformly random order (without repetitions). 

• The algorithm must decide whether to include the element in its output (irrevocably). 

The objective of the algorithm is to select an independent subset maximizing /. When making 
decisions, the algorithm has access to the value of n, the supermodular degree of / and the following 
oracles. The first oracle is the independence oracle given above, which gives information about the 
constraint. The second oracle gives information about the objective function. This oracle is the 
counterpart of the value oracle defined above. 

Definition 3.1. Online marginal oracle of a set function f : 2^ —>• R + is the following: 

Input: An element u € M that has already been revealed and a subset S C J\f. 

Output: f{u | S). 

Note that S does not have to be fully (or even partially) revealed, but u does have to. That 
is, we can only ask marginal queries for elements that have been revealed, but we can ask for their 
marginal value with respect to any subset. Our algorithms use the online marginal oracle only 
for the purpose of finding the best marginal that an element u can have with respect to already 
accepted elements and subsets of V + (u). This use is consistent with the above motivation of our 
model. 

The last oracle of our model returns the dependency sets of already revealed elements. 

Definition 3.2. Online supermodular oracle of a set function f : 2 ^ —>• R + is the following: 

Input: An element u € M that has already been revealed. 

Output: Vf(u). 

Observe that the last oracle can, in fact, be implemented using the online marginal oracle, albeit 
using an exponential time complexity in n. However, this oracle is a natural online variant of the 
supermodular oracle, and thus, it emphasizes the relation between our model and previous work on 
the supermodular oracle. If time complexity is not a priority, as is often the case when analyzing 
online algorithms, then every use of this oracle can be replaced by an appropriate procedure using 
the online marginal oracle. 
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Additional Notation. In the rest of this paper we use the notation N u to denote the set of 
elements revealed up to the point in which a given element u £ JV is revealed (i.e., J\f u contains u 
and every other element revealed before u). 

3.1 Our results 

In this section we formally state our results for the model introduced above. Our first result is for 
instances where the matroid M is of rank k < T> ^ + 1. This setting is interesting for two reasons: 
it is closely related to the classical secretary problem, and our algorithm for it is used as a building 
block in our algorithms for other settings. 

Theorem 3.1. There exists an O(klogk) = 0(T>~jl log T>J)-competitive algorithm for the monotone 
matroid secretary problem when the rank of the matroid constraint M is k < T>J + 1. 

£)+ 

The time complexity of the above algorithm (and all our other algorithms) is Poly(n, 2 f ). The 
exponential dependence of the time complexity on T> ^ is unavoidable even for offline algorithms 
and a uniform matroid constraint (see [13] for more details). Our main result is given by the next 
theorem. 

Theorem 3.2. There exists an 0(T>~j 3 log T> + D)S~ 2 log k)-competitive algorithm for the monotone 
matroid secretary problem. 

Interestingly, the above competitive ratio matches the, till recently, state of the art ratio of 
0(log k) by Gupta et al. [2D] for the case where / is a submodular function, and extends it to any 
constant supermodular degree. For uniform matroids we have the following improved guarantee. 

Theorem 3.3. There exists an 0(T>^ \ogT>^)-competitive algorithm for the monotone matroid 
secretary problem when the matroid M is uniform. 

Note that the guarantee of Theorem 13.31 has no dependence on k , and thus, it yields a constant 
competitive ratio for a constant supermodular degree. 

It is handy to assume that / is normalized (i.e., /(0) = 0). Reduction [1] in Appendix lAl shows 
that this assumption is without loss of generality, and thus, we implicitly assume it in all our proofs. 


Lower Bounds. Note that even the offline version of maximizing a function / with respect to ma¬ 
troid constraint is AfP-hard to approximate within a guarantee of U(lnT>J/Uj) (see, e.g., [131:21]). 
Moreover, for a uniform matroid constraint, it is SSE -hard to achieve any constant approximation 
guarantee (even) in the offline setting m- 


3.2 Related results 

Secretary problem. Many variants of the secretary problem have been considered through¬ 
out the years, and we mention here only those most relevant to this work. Under a cardinality 
constraint of k, Babaioff, Immorlica, Kernpe and Kleinberg [3] and Kleinberg [23] achieve two in¬ 
comparable competitive ratios of e and 1/[1 — 0(1/V^)], respectively, for linear objective functions. 
For submodular objective functions, the best algorithms have a competitive ratio of 8e 2 ~ 59 for 
the general case [j6] and a competitive ratio of (e 2 + e)/(e — 1) « 5.88 when the objective is also 
monotone M- 

4 That is, a guarantee that does not depend on Df. 
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The matroid secretary problem considers a linear objective and a general matroid constraint. 
This variant was introduced by Babaioff, Immorlica and Kleinberg [5], who described an 0(log/c)- 
competitive algorithm for it (where k is the rank of the matroid) and conjectured the existence of 
an 0(l)-conrpetitive algorithm. Motivated by this conjecture, 0(l)-conrpetitive algorithms have 
been obtained for a wide variety of special classes of matroids including graphic matroids 0 [25] , 
transversal matroids [3121125], co-graphic matroids [28], linear matroids with at most k non-zero 
entries per column |28| . laminar matroids mmm, regular matroids m, and some types of de¬ 
composable matroids, including max-flow min-cut matroids m- However, progress on the general 
case has been much slower. An 0(\/log /^-competitive algorithm was described by Chakraborty and 
Lachish [8], and very recently two 0(log log /^-competitive algorithms were given by Lachish [26] 
and Feldman, Svensson and Zenklusen m 

The submodular variant of the matroid secretary problem was also considered. For general 
matroids [20] gave an 0(log /^-competitive algorithm, and 0(l)-competitive algorithms were de¬ 
scribed for special classes of matroids including partition matriods mmm and transversal and 
laminar matroids [27]. A recent work [18] shows that any algorithm for the linear variant can 
be translated, with a limited loss in the competitive ratio, into an algorithm for the submodular 
variant. This implies an 0(l)-competitive algorithm for the submodular variant under any class 
of matroids admiting such an algorithm for linear objectives, and an 0(log log/c)-competitive al¬ 
gorithm for general matroids. A secretary problem with an even more general family of objective 
functions was considerd by Bateni, Hajiaghayi and Zadimoghaddam [6] who proved an hardness 
result for the family of subadditive objective functions. Finally, variants of the matroid secretary 
problem which use a different arrival process or a non-adversarial assignment of element values 
were also considered mmm- 

Complexity measures of set functions. Complexity measures of set functions have been previ¬ 
ously studied. Abraham, Babaioff, Dughmi and Roughgarden IT] studied the welfare maximization 
problem with respect to a complexity measure that gives greater values to set functions (i.e.. mark 
them as more “complex”) as their complementarity increases, in some sense. This complexity mea¬ 
sure is applicable only to a restricted class of set functions. The notion of supermodular degree, 
which we use in this work, was introduced by [14] . again, with an application to the welfare maxi¬ 
mization problem, and was later used for a more general appication by m- A stronger complexity 
measure for set functions was studied by [13]. However, their results assume access to a demand 
oracle (see Blumrosen and Nisan 0 ). which is common in the context of combinatorial auctions, 
but, to the best of our knowledge, has not been used outside this world. 


Complements in online settings. A different online setting exhibiting complements can be 
found in the work of Ernek, Halldorsson, Mansour, Patt-Shanrir, Radhakrishnan and Rawitz on 
online set packing fT2] . 


4 Small rank matroids (Theorem 13.11) 


In this section, we describe the main intuitive ideas behind the proof of Theorem l3.il The proof itself 
is deferred to Appendix [B] For simplicity, we assume in this section that M is a uniform matroid 
of rank d + 1. The extension to general matroids and smaller ranks is quite straightforward. 

A natural generalization of the algorithm for the classical secretary problem is the following 
algorithm. First, during the learning phase, reject the first 0(n/d ) elements. From the remain¬ 
ing elements, take the first one whose marginal contribution with respect to some of its future 
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supermodular dependencies (i.e., the elements that may increase its marginal contribution and the 
algorithm can still choose to take) is better than any such contribution inspected thus far. 

It is not difficult to argue that the best marginal contribution seen by the above algorithm 
is always at least f(OPT)/(d + 1). However, to get a competitive ratio guarantee, we need to 
show that the algorithm manages to pick this best contribution with a significant probability. One 
approach to proving this claim is by generalizing the analysis of the classical secretary algorithm 
to this more general algorithm. Such a generalization requires lower bounding the probability that 
the following two events occur (at the same time). 

• The element with best marginal contribution arrives after the learning phase. 

• The second best marginal contribution, up to the point where we see the best contribution, 
is seen during the learning phase. 

Unfortunately, it is difficult to bound the above probability due to the following phenomenon. 
The earlier an element arrives, the more future supermodular dependencies it has, and thus, the 
higher its corresponding marginal contribution. Hence, elements in the learning phase tend to have 
larger marginal contributions in comparison to elements appearing after the learning phase. 

To overcome this issue, we modify the algorithm. Specifically, instead of comparing the marginal 
contribution of the current element to the marginal contributions seen thus far, we compare it to 
the marginal contributions that the elements seen thus far could have if they would have arrived at 
this time (instead of the time in which they have really arrived). A similar idea has been previously 
used by a work on the case of a submodular objective function ng. 

Additionally, to get the exact approximation ratio guaranteed by Theorem 13.11 the algorithm 
has to use a random threshold from a logarithmic scale. This allows the analysis to assume that 
(with a significant probability) the learning phase takes about half of the time up to the point when 
the best contribution is observed by the algorithm. 

5 Estimation aided algorithms 

We say that a value opt Q is an cr-estimation of an optimum solution OPT if it obeys f(OPT)/a < 
opt a < f{OPT). We say that an algorithm is a-aided if it assumes getting an cc-estimation of 
the optimum as part of its input. In this section we describe an aided algorithm for the case of a 
general matroid constraint. An improved aided algorithm for the special case of a uniform matroid 
constraint can be found in Appendix [Dj In the next section we explain how to convert our aided 
algorithms into non-aided ones. Note that our aided algorithms work even under a model where 
the arrival order is determined by an adversary. However, the randomness of the input is required 
for converting them into non-aided algorithms. 

Theorem 5.1. For every a > 1, there exists an a-aided 0{d? log(afc))- competitive algorithm for 
the monotone matroid secretary problem. 

The algorithm we use to prove Theorem 15.11 is Algorithm |T] A few of the ideas we use in 
Algorithm [T| and its analysis can be traced back to [5]. 

We define a weight w(u) for every element u E OPT as follows: w(u) = f(u \ OPT \ J\f u ). For 
ease of notation, we extend w to subsets of OPT in the natural way. Let us denote p\ = — [log 2 k~\ 
and p 2 = |~log 2 aT|. For every integer p\ < p < p 2 , we define a set (bucket) OPT p = {u € OPT \ 
2 p ■ < w(u) < 2 p -opt Q }. Intuitively speaking, the following lemma shows that there is sufficient 

value in all the buckets together. The lemma holds since every element that does not get into any 
bucket must have a very low weight. 

Lemma 5.2. w (lj P P = Pl OPT^j > 7 (° 2 PT ) . 
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Algorithm 1: a-Aided Algorithm for General Matroids 


1 

2 

3 

4 

5 

6 
7 


Let p be a uniformly random integer from the set {— [~log 2 k~\ — 3, — |dog 2 k~\ — 2,..., |dog 2 a]}. 
Let r «- 2P ■ 

Let S <— 0. 

for every arriving element u do 

if there exits a set D*(u) C V + {u) \M U such that f{u \ D*(u) U S) > r and 

S U D*{u) + u € X then 

Add D*(u) + u to S. 


8 return S. 


We defer the proof of Lemma 15.21 and the other lemmata of this section to Appendix [Cj Our 
next objective is to show that if Algorithm |T] selects a value p, then its gain is proportional to 
w(OPT p+ 3 ). Whenever S appears below it denotes the output of the algorithm. 

Lemma 5.3. If Algorithm Q] selects a value p and |Sj > | 0 PT p+ 3 |/[ 2 (<i + 1)], then f(S ) > 
w(OPT p+3 )/[32(d + l) 2 }. 

Intuitively, the last lemma holds since a large \S\ means that the algorithm adds elements to S 
in many iterations, and each iteration increases f(S) by at least r. 

Lemma 5.4. If Algorithm Q selects a value p and |Sj < \OPT p+3 \/[2(d + 1)], then f(S) > 
w(OPT p+3 )/ 8 . 

The main idea behind the proof of Lemma [5.41 is as follows. Since |Sj is small, many elements of 
OPT p+3 \S could be added to it, together with their dependencies, without violating independence. 
The reason these elements were not added must be that they did not pass the threshold, which can 
only happen when f(S) is large enough. 

Corollary 5.5. If Algorithm^ selects a value p, then f(S) > w(OPT p+3 )/[32(d + l) 2 ]. 

We are now ready to prove Theorem 15.11 

Proof of Theorem, I5.il Recall that every value p is selected by Algorithm |T] with probability at least 
(log 2 a + log 2 k + G)^ 1 = (log 2 (afc)+ 6) _1 . Hence, by Corollary 15.51 the expected value of the output 
of Algorithm [T] is at least: 

1 A w(OPT p+3 ) _ w (l J P 2 =P1 OPT p ) f(OPT) 

\og 2 (ak) + 6 32 (d + l ) 2 32 (d + l ) 2 • [log 2 (afc) + 6] “ 64 (d + l ) 2 • [log 2 (ak) + 6] 

where the last inequality is due to Lemma 15.21 □ 

6 Estimating the optimum: from aided to non-aided algorithms 

In this section, we show how to convert aided algorithms into non-aided ones. Together with our 
aided algorithms, the following theorem implies the results stated in Theorems 13.21 and 13.31 

Theorem 6.1. If there exists a (80(d + 2) 2 )-aided ft-competitive algorithm ALG for the monotone 
matroid secretary problem with supermodular degree d, under a class C of matroid constraints closed 
under restriction, then there also exists a non-aided 0(d 3 log d + (3)-competitive algorithm for the 
same problem. 










Recall that the truncation of a matroid M = (, Af , X) to rank k' is a matroid M 1 = (J\f, X') where 
a set S' C AA is independent in M' if and only if S € X and |S| < k!. The algorithm we use to prove 
Theorem EH is Algorithm [3 

Algorithm 2: Multiple Elements Estimation 


1 

2 


with probability 1/2 do 

Apply the algorithm guaranteed by Theorem 13. II to the problem after truncating the 
matroid to rank minjfc, d + 1} (where k is the rank of the original matroid). 


3 

4 

5 

6 

7 

8 
9 


otherwise 

Choose X according to the binomial distribution B(n, (d + 2) _1 ), and let T the set of the 
first X elements revealed. 

Let A <— 0 and W 4— 0. 

while there exist u £ T\ A and T> u C T> + (u) s.t. A U T> u + u € X do 
Find such a pair maximizing f(u \ A U T> u ). 

Increase W 4— W + f(u \ A U T> u ). 

Update A 4— A U V u + u. 
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Apply ALG to the remaining elements with opt 80 (d + 2)2 = W/ 10. 


Algorithm [2] consists of two parts, each executed with probability 1/2. In order to prove 
Theorem 16.II we show that for any instance of the monotone matroid secretary problem, one of the 
following cases is true: 

• The algorithm guaranteed by Theorem 13.11 produces an 0(d 3 log d)-competitive solution for 
the non-truncated problem. 

• The second part of Algorithm [2] is O(/^-competitive. 

To determine which of the above cases is true for every given instance we need some notation. 
Let u* * £ J\f be an element maximizing 

max f(u* I S) , 

SCV+(u*) 

s+u*ei 

and let m* and S* denote the value of this maximum and an arbitrary corresponding set S, respec¬ 
tively. It can be shown quite easily that the first case above holds when m* > f{OPT) /(256(d-|-l) 2 ) 
(see Lemma lC.ll for details). Thus, we sketch here only the more interesting part of the analysis, 
which is to show that the second above case holds when m* < f(OPT)/(256(d + l) 2 ) (a full proof 
can be found in Appendix [E|). The main thing that we need to show is that optso(d-|- 2 ) 2 with 
constant probability, a 80(d+ l) 2 -estimation for f(OPT). For that purpose, we relate the expected 
value of the estimate opt 80 ( rf+2 ^2 to f(OPT), and then bound the variance of this estimate to show 
that it is close enough to its expected value with constant probability. 

In the rest of this section we assume Algorithm [2] executes its second part. Let Wg and Ag 
be W and A, respectively, when Algorithm [2] exits its loop. We bound Wg , which immediately 
implies bounds for opt 80 (d_|_ 2 ) 2 • I R order to achieve that, we switch our attention from Algorithm [2] 
to an offline algorithm (Algorithm [3]) producing exactly the same distribution for Wg. We explain 
intuitively why the distributions of Wg in both algorithms are the same. Let us choose a set T pre 
ahead, exactly as T is chosen by the online algorithm. Next, we modify the offline algorithm so 
that whenever it chooses an element from its set T, instead of randomly deciding whether to keep 
it in T, it queries membership in T pre . Clearly, since there are no repetitions in these queries, this 
does not affect the behavior of the offline algorithm. However, one can verify that the output of the 
offline algorithm is now identical to the output of the online algorithm when it selects T = T pre . 
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Algorithm 3: Offline W Calculation 


Let A <— 0, W <— 0 and T <— JV. 

while there exist u € T\ A and T> u C T> + (u) s.t. A U T> u + u € X do 
Find such a pair maximizing f{u \ A U T> u ). 
with probability (d + 2) -1 do 

Increase W <— W + f(u \ A U V u ). 

Update A <— A U V u + u. 

otherwise Update T ■(— T — u. 


Let Le be the sum of f(u \ A U T> u ) for all the iterations done by Algorithm [3l regardless of the 
random choice made by the algorithm. We show how to lower bound the expectation of Li with 
respect to f(OPT), and then use a bound on the variance of Wf to get a concentration result for 
W e . 

Lemma 6.2. (d + 1) ■ Le > f(Ai). 

Brief sketch of proof. The proof is by induction on the number of iterations. Assume the lemma 
is true for i — 1 iterations, and let us prove it for iteration i. Trivially, if Algorithm [3] randomly 
chooses not to add elements to A, then the lemma is true for iteration i as well. Now, assume the 
random choice made is to add the elements to A. Note that only up to d + 1 elements are added 
to A, and therefore, it is sufficient to show that the marginal value of each is upper bounded by 
the increase in the value of L. Let u be the element chosen by the algorithm. Any dependency 
of u that appears in T could also be chosen instead of u, so its marginal is upper bounded by 
the increase in the value of L (since the algorithm uses a greedy choice). On the other hand, any 
dependency v! qLT must have been removed when chosen by the algorithm in a previous iteration. 
Therefore, its marginal value, computed with respect to its optimal dependencies in this previous 
iteration, is already counted by Lg. Note that the last marginal value must be at least as large 
as the marginal value of v! with respect to the optimal dependencies in the current iteration (by 
definition of supermodular dependencies). □ 

Lemma 6.3. f(Ae) + (d + 1) • Li > f(OPT). 

Brief sketch of proof. We consider a hybrid solution starting as OPT and ending as A%. We use the 
matroid augmentation property to observe that, when a new element of A is added to this hybrid 
solution, no more than d + 1 non A elements have to be removed to restore independence. Then, 
we bound the “damage” resulting from the removal of these elements using the greedy choice of 
the algorithm and the definition of supermodular dependencies. Finally, we observe that the value 
of Ai cannot be increased by adding elements that are still in T, since, otherwise, the algorithm 
would have done that. This means that An itself is as good as the final hybrid (which contains 
it). □ 

An immediate corollary of the last two lemmata is a lower bound of f(OPT)/(2(d + 1)) on 
Li. This bound, together with a concentration result we prove in Appendix [El shows that Wg > 
f(OPT)/0(d 2 ) with constant probability. The last inequality implies with constant probability, 
when m* < f(OPT)/(256(d+ l) 2 ), that opt 80 (d + 2)2 = VU/10 is indeed a (80(d + 2) 2 )-estimation for 
the optimum of the part of the input that was not read by the estimation algorithm (Le., the input 
for the aided algorithm). The competitive ratio of the second part of Algorithm [2] then follows from 
the competitive ratio of the aided algorithm. 
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A Assuming our set functions are normalized is without loss of 
generality 

Reduction 1. If ALG is an a-competitive algorithm for the monotone matroid secretary problem 

under the assumption that f is normalized, then ALG is an a-competitive algorithm also without 

this assumption. 
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Proof. Let g: 2^ —>• M + be the function g(S) = f(S) — /(0). Notice that g is a non-negative 
monotone function and Pj = P,j". Moreover, all the oracles that an algorithm for the monotone 
matroid secretary problem has access to return the same answers for both / and g , and thus, the 
algorithm produces a random set S with the same distribution when given either / or g as input. 
Since g is normalized, by the definition of ALG : 


ms)} > 


g(OPT ) 


a 


where OPT is a set maximizing g (and /). Thus: 


E[/(£)] = E \g(S)} + f(0) > 9{OPT) + /(0) > /(OPT) 

a a 


□ 


B Small rank matroids (Proof of Theorem 13.11) 

In this proof we need some additional notation. The max-marginal of an element u at time i is 
the largest marginal value that u can contribute to a subset of the elements that arrive after time 
i (while keeping the subset independent). More formally, let A/) be the (random) set of the first i 
elements that arrived, then the max-marginal of an element u at time i is: 

m ma x(M) = max f(u \ S ) . 

SCAf\Mi 
S T u^L 

We also use S max (u, i) to denote an arbitrary set for which the maximum is obtained. Note that one 
can calculate both m max (u,i) and S max (u,i) in 0(2 d ) time. We begin the proof with the following 
simple claim. 

Claim B.l. We may assume that n is dividable by any quantity h whose value is polynomial in k. 

Proof. Let n' be the least multiple of h which is at least as large as n. Note that n! is polynomial 
in n (since k < n). Let A/ 7 be a ground set containing the elements of A f and a set D of n' — n 
dummy elements. We extend / and M to A/ 7 as follows: 

• The function /': 2^' —>■ M + is defined as: f(S) = f(S \ D) for every set S C A/ 7 . Note that 

f is non-negative, monotone and has a supermodular degree of d. Additionally, T>^,{u) = 0 
for the dummy elements of D, and P^",(m) = for every other element. 

• The matroid M' = (JV 1 ,! 1 ) is defined by the following rule. A set S C M' is in T' if and only 
if S \ D € X and IS) < k. Note that this rule defines a matroid of rank k which is uniform 
whenever M is. 

One can observe that the problems (/, M) and (/',Af') are equivalent in the sense that: any 
solution for (/, M ) is also a solution for (/', M') of the same value, and removing the dummy 
elements of any solution for (f, M') results in a solution for (/, M ) of the same value. Moreover, 
given access to the oracles corresponding to (/, M), one can efficiently implement the oracles for 
(/', M'). Thus, given algorithm ALG that is r-competitive for ground sets obeying the requirements 
of the reduction, one can construct an r-competitive algorithm for general ground sets as follows: 

1. Apply ALG to the instance ( f,M'). 

2. Accept every element of A f = A/ 7 \ D that ALG accepts. □ 
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In the rest of this section we make two assumptions. First, we assume that n is dividable 
by 10fc, which is justified by Claim IB.II Second, we assume k > 2 (if k = 1, then the classical 
secretary algorithm can be used to get an 0(l)-competitive algorithm). Our objective is to show 
that Algorithm 0] obeys the requirements of Theorem 13.11 given these assumptions. 

Algorithm 4: Small Rank Matroid 


Select an arbitrary order -< over the elements of the ground set Af. 

Let p be a uniformly random integer from the set {0, 1,... , [log 2 k~\}. 

Reject the first t = 2 P ■ ^ elements, 
for i = t + 1 to n do 

Let Ui be the element arriving at time i. 

if for every element u £ A/)_i either m max (tq,z) > m max (u, i) or 

f m max(' Ui,i) = m max (u, i) and, Ui >- u) then 
Terminate the “for” loop and accept the elements of S max («j,z) +Ui when they arrive. 


For the purpose of analyzing Algorithm 01 it is helpful to think about the input as created 
backwards by the following process. The set Af n is simply the entire ground set A f. Then, the 
last element of the input u n is selected uniformly at random from Af n , and the set Af n -\ becomes 
Af n — u n . On the next step, the (n— l)-th element u n -\ of the input is selected uniformly at random 
from Af n -\ and we set Af n -2 = Af n -\ — u n _\. The process than continuous in the same way, i.e., 
when it is time to determine the i-tli element of the input, this element is selected uniformly at 
random from A/), and we set A/)_i = A/) — Uj. 

We say that an element u is the top element of a set A/) if for every other element v! £ Afi\u either 
nimax^)*) < rn max (u, i) or m max (u / ,z) = m max (tt,i) and u' X u. Note that Line[7]of Algorithm [5] in 
fact checks whether Ui is the top element of A/). Additionally, we say that an input is well-behaved 
with respect to the value t and order X chosen by Algorithm 0] if it has the following properties: 
(Al) There exists a time i > n/k such that for some element u £ A/), nn max (u,z) > f (OPT)/k, 
where OPT is an independent set maximizing /. We denote the first such time by t\. 

(A 2 ) There exists exactly a single time t < i < £\ such that tq is the top element of A/)@ We 
denote this time by £ 2 - 

The analysis of Algorithm 0] consists of two parts. First we show that it produces a good 
output for well-behaved inputs, and then we show that the input is well-behaved with a significant 
probability. 

Lemma B.2. Algorithm [7] outputs a solution of value at least f(OPT)/k when its input is well- 
behaved with respect to t and X. 

Proof. The definition of the algorithm and Property A[2] guarantees that the algorithm outputs 
Smax(^ 2 ’ ^ 2 ) + The value of this solution is: 

/(Smax (^£2 > ^ 2 ) T IR2) — ./" I S max (u ^ 2 , £ 2 )) — el max ( ^£2 ■ ^2 ) ■ 

Hence, we are only left to lower bound m ma , x (u^ 2 ,£ 2 )■ Observe that by Property A0J there exists 
an element u'g £ A /4 such that m max (tt £ 1 , ^ 1 ) > f(OPT)/k. Let us prove by a backward induction 
that this is true for every £2 < i < £\, i.e., that for every such time there exists an element u[ € A/j 
such that m tnax (u), i) > f(OPT)/k. 

Assume the claim holds for a given £2 < i < t\, and let us prove it for i — 1 . Observe that 
we can assume without loss of generality that u[ is the top element of A/). Thus, by Property A[ 2 ] 

5 Note that in some cases we might have li < t. In these cases the input is not well-behaved with respect to t 
and X. 
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u[ 7 ^ Ui, which implies: v! i € N % -\. By definition m max (u, i) is a non-increasing function of i, hence, 
rn ma — 1) > m max ('(i’, i) > f(OPT)/k, which complete the induction step. 

The claim we proved by induction implies: m max (it^ , ^ 2 ) > f(OPT)/k. The lemma now fol¬ 
lows by observing that Property .AO guarantees that ug 2 is the top element of Mp 2 , and thus, 
^max(^fc)^) P ITImax^ji^)' I—I 


Lemma B.3. Property d[7 \ holds with a probability of at least 0.2. 

Proof. Given an element u E JV, let i u denote the time when it arrives. Then, 
E m max (M„)> E f( u \OPT\AT u ) = f(OPT) , 

uGOPT u&OPT 


where the inequality follows from the definition of m max . Since \OPT\ < k, we get by averaging 
that for some element u € OPT there must be m max (u, i u ) > f (OPT)/k. Hence, Property A[T]is 
guaranteed to hold when all the elements of OPT appear after time t = n/k. The last event occurs 
with a probability of at least: 


' ( t )) ' ( n _ (n — k)\ ■ (n — t)\ _y-r n — k — i 

n\ n\-{n — k — t)\ M n — i 


> 


n — k — t 


\ n — t I 
> e -i/o.9 .( 1 _ 


= 1 - 


n — t 


> 1 - 


k 

0.9 2 • n 


k \ n/k 


0.9n 

1 


> e" 10 / 9 • f 1 - _L ) > 0.2 


□ 


Lemma B.4. Given that Property d[I \holds, Property J^holds with a probability of at least (log 2 k+ 
2) _ 1 /4. 


Proof. First, let us consider the event E\ that there exists a time l\/2 < < l\ such that is 

the top element of J\fp 2 and for every time < i < £\, Ui is not the top element of J\f % . For this 
event not to occur, a non-top element m must be selected from TV",; for every time £\/2 < i < 
which happens with probability: 


n 


i=L£i/2+lJ 



LV2J < 1 

h ~ 2 


Hence, E\ occurs with the complement probability, which is at least 1/2. Next, given that E\ 
occurred, we are interested in the event that T 2 f2 < t < £ 2 , which we denote by E 2 . It is important 
to notice that E\ is independent of the choice of t by the algorithm, and thus, the distribution of t 
is unaffected by conditioning on E\. Additionally, notice that: 


2 L = IL < l ± < g 
2k 2k ~ 2 < 2 


and 


< r f K 2 riog 2 fci. 

2 ~ 2 ~ 2k 


Hence, one of the possible values of t obeys the requirement £' 2 /2 < t < 1' 2 . Since t takes at most 
log 2 k + 2 different values, and it takes them with equal probabilities, we get that E 2 occurs with 
a probability of at least (log 2 k + 2 ) -1 given E\. 

Given that E\ and E 2 both occur, for Property A0 to hold with need the additional event that 
in the range (t,l 2 ) no element Ui is the top element of TV/. Note that the order of the elements of 
J\fgi — up 2 is independent of E\ and E 2 . Hence, the probability of this event is at least: 



i=t -\-1 


l-l 


i 


t ^ 2/2 

f 2 -1 - 1> 2 -1 


> 1/2. 


□ 
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We are now ready to prove Theorem 13.11 


Proof of Theorem 13.11 Lemmata IB.3I and IB.41 imply that the input is well-behaved with respect to 
t and -4 with probability at least (log 2 k + 2) - 1 /20. By Lemma IB. 21 when the input is well-behaved 
with respect to t and -4, Algorithm 0] outputs a solution of value at least f(OPT)/k. Hence, the 
competitive ratio of Algorithm 0 is at least: 

20 &:(log 2 k + 2 ) = 0(k log k) . □ 


C Missing proofs 

This section contains proofs that have been omitted from the main body of the paper. 

Proof of Lemma \5.°A Clearly w(OPT) = f(OPT). Moreover, by definition, 2 Pl -opt a < f(OPT)/k. 
Hence: 


f(OPT) - w ( 0 OPT p ) = Y. w ( u ) < k ' ( 2P1 ' 0 P t a/ 2 ) < 


f(OPT) 


□ 


\p=p i 




wGOPT 

w(u)<2 p l -opt a /2 


Proof of Lemma 15.31 For an element u E Af, let S u be the set S immediately before u is processed 
by Algorithm [I] Note that each time that Algorithm [T] adds elements to S, it adds up to d + 1 
elements and f(S) increases by at least r since, by monotonicity: 

f{D*{u ) + u | S u ) > f(u | D*(u) U S u ) . 

Hence, we can lower bound f(S) by: 


f(S)> 


|g| ' 

d + 1 


r > 


\OPT p+3 \ [1 

2(d + l ) 2 [l 6 uedPT p+ 3 


max 


w(u ) 


> 


w(OPT p+3 ) 
32(d + l ) 2 


□ 


Proof of Lemma [5171 Observe that OPT p+3 is a subset of OPT, and thus, independent. Hence, by 
the matroid properties, there exists a set O' C OPT p+ 3 \ S of size at least |O.PTp + 3 | — |5| such that 
O' U S € X. Every element u E OPT p+3 \ O' can belong to the dependence set of at most d other 
elements of OPT p+ 3 . Thus, the number of elements u E OPT p+ 3 having T> + (u) n OPT + u % O' is 
upper bounded by: 

(d+ i).|si<!£S±2i, 

In other words, there exists a set O" C OPT p+ 3 of size at least \OPT p+3 \/2 such that V + (u ) n 
OPT + u C O' for every u E O". Observe that by monotonicity: 

f{0') > Y f( u I 0'\Af u ) > Y f( u I OPT\Af u ) 

u eO" ueo" 


= w(0") > 


\opt p+3 \ 


min w(u) > 

uGOPT p+3 


w(OPT p+3 ) 


where the second inequality holds since O' already contains all the elements of T> + {u ) D OPT. 

Every element u E O' must have been rejected upon arrival by Algorithm [T| due to the threshold. 
Moreover, for every such element u we have ([V + (u) n (O' U 5)] \Af u + u) U S u C O' U S E 1 (where 
S u is, again, the set S immediately before u is processed by Algorithm [ 1 ]). Hence: 


f(u | (o' \ M u ) u s)< f(u | [v+(u) n (o' u 5)] \M u us u )<r , 
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where the first inequality holds by the definition of T> + (u). Adding the last inequality over all 
elements u € O' gives: 

w(OPT p+3 ) < /(0 , } < /(0 / us) = f{s) + £ /(„ | (O' \ AQ U 5) < /(SO + |OPr p+3 | • r 
4 ueo' 

< m + IOPT p+3 | ■ < /(S) + «±2> , 

The lemma now follows by rearranging the last inequality. □ 

Lemma C.l. If m* > f(OPT)/[256(d + l) 2 ], then Algorithmic is 0(d? log d)-competitive. 

Proof. Note that S* + u* is an independent set in the matroid even after it is truncated to rank 
minjfc, d+ 1}. Hence, when Algorithm [3] applies the algorithm guaranteed by Theorem 13. II fwhich 
happens with probability 1/2), the produced set has an expected value of at least: 

f(S* + u*) > f(u* 1 S*) m* f (OPT)/[256(d + l) 2 ] 

0{d\ogd) ~ O(dlogd) O(dlogd) ~ O(dlogd) 

D Estimation aided algorithm for a uniform matroid constraint 

In this section we prove the following theorem. 

Theorem D.l. For every a > 1, there exists an a-aided 0(dlog a)-competitive algorithm for the 
monotone matroid secretary problem when the matroid M is uniform. 

Before proving the existence of an a-aided algorithms for any a > 1, let us begin with a 2-aided 
algorithm. 

Proposition D.2. There exists a 2-aided 0(d)-competitive algorithm for the monotone matroid 
secretary problem when the matroid M is uniform. 

The algorithm we use to prove Proposition ID.21 is Algorithm EJ 


Algorithm 5: 2-Aided Cardinality 

l 

Let r Tt- 


2 

Let S i — 0. 


3 

for every arriving element u do 


4 


if there exits a set D*(u) C V + (u) 

\ Af u such that f(u \ D*(u) U S) > t and 

5 



S| + \D*(u)\ + 1 < k then 

6 


Add D*(u) + u to S. 


7 

return S. 



Let S u be the set S before the element u is processed. When S appears below without a 
subscript it denotes the output of the algorithm. 

Lemma D.3. If |5| < max{0, A: — d — 1}, then f(S) > f(OPT)/2. 
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Proof. Assume, towards a contradiction, that |5| < max{0, k — d — 1} and still f(S) < f(OPT)/ 2. 
Since |S| < max{0, k — d — 1}, for every element u € OPT \ S Algorithm [5] could add the set 
['D + (u ) fl ( OPT U 5)] \ Af u + u to S. From the fact that the algorithm did not add this set (or any 
other set containing u) to S, we learn that: 

f(u I (OPT \ A f u ) u S)< f(u I p v + (u ) n (OPT U S)j \Af u U S U )<T , 

where the first inequality holds by the definition of T> + (u). Adding the last inequality over all 
elements u € OPT \ S gives: 

f{OPT) < f(OPT U S) = f(S ) + ]T f(u | (OPT \ Af u ) U S) < f(S ) + kr . 

u£OPT 


Plugging the assumption that f(S) < f(OPT)/2 and the definition of t into the last inequality 
gives an immediate contradiction. □ 


Lemma D.4. If |S| > max{l, k — d}, then f(S) > f {OPT)/ [8(d + 1)]. 

Proof. Note that each time that Algorithm [5] adds elements to S , it adds up to d + 1 elements and 
f(S) increases by at least r since, by monotonicity: 

f(D*(u) + u | S u ) > f(u | D*(u ) U S u ) . 


Hence, we can lower bound f(S) by: 


m> 


> 

r \s\ i 

• r > 


d+ 1 



maxjl, k — d} 
d+ 1 


opt 2 ^ fc f(OPT) 
2k ~ 2 (d + 1) ' 4k 


f(OPT) 
8 (d + 1) 


□ 


Proposition ID.21 follows immediately from the last two lemmata. Theorem ID. II generalizes 
Proposition ID.21 to general cc-aided algorithms. The algorithm we use to prove Theorem ID.II is 
Algorithm [6j 


Algorithm 6: cc-Aided Cardinality 

1 Let p be a uniformly random integer from the set {0,1,... , [log 2 af|}. 

2 Apply Algorithm [5] with opt 2 = 2 P ■ opt Q . 


Proof of Theorem \D.1\ The largest value Algorithm O can assign to opt 2 is: 

2fi°g«l . op t Q > a • ( f(OPT)/a ) = f(OPT) . 

On the other hand, the smallest value Algorithm [HI can assign to opt 2 is: 2 0 -opt Q < f(OPT). Hence, 
for some p Algorithm [6] is guaranteed to produce a value opt 2 obeying f(OPT )/2 < opt 2 < f(OPT ), 
in which case Algorithm 0 is 0(d)-competitive by Proposition ID.2I Since every value of p occurs 
with a probability of at least l/(log 2 a + 2), we get that the competitive ratio of Algorithm [6] is at 
most 0(d log a). □ 

E Full proof for m* < /(OPT)/(256(d + l) 2 ) 

In this section we analyze Algorithm [2] in the case of a small m*. We begin with a concentration 
result proved in Section IE. II The analysis of Algorithm [2] appears in Section IE.21 
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E.l Concentration result 


In this section we study a stochastic process consisting of rounds. In each round i > 1, a positive 
value Xj E (0, B\ (for some parameter B > 0) arrives, and is flagged “accepted” with a probability 
p E [0, 1], independently, and “rejected” otherwise. The value Xj itself might depend on the way 
previous values have been flagged, but not on the way Xj itself is flagged. More formally, let A be the 
set of indexes corresponding to accepted values, then Xj is a function of the set An {1,2,..., i — 1}. 
The process terminates after T > 1 rounds, where T itself might depend on the way values have 
been flagged. However, it is guaranteed that T is upper bounded by a finite integer T and: 

T 

Y X ^ L 

i =1 

for some parameter L > 0. 

Let T,a = J2ieA X i be the random sum of the accepted values. Our objective is to show a 
concentration bound for Ya- Let us first prove such a bound for the case when we make an 
additional assumption. 

Assumption E.l. Each value Xj is equal to B/2 J for some value j > 0. 

Using the above assumption, we can now define some additional notation. Let 5 be the smallest 
number such that some value Xj has a positive probability to take the value 5. Observe that 6 is 
well defined since the above process has only finitely many possible outcomes. By Assumption IE. ll 
every value Xj is a multiple of 5. 

It is helpful to think of the values Xj as intervals placed one after the other on the axis of 
real numbers, starting from 0. In other words, for every value Xj we have an interval starting at 
Jlj/=i x j and ending at Taking this point of view, every interval Xj can be partitioned 

into Xj/ 8 ranges of size 5. Let us associate a random variable with each one of these ranges. More 
formally, for every i > 1, let Ij bea random variable taking the value 1 when the range (<5(z — 1), 5i) 
is contained within an accepted interval Xj , and the value 0 in all other cases. 

Lemma E.2. Under Assumption I E. 11 Ya = 5 ■ y i- 

Proof. Fix a realization of the above process. Recall that Yj is zero whenever the range (5(i— 1), Si) 
is not contained within an accepted interval. On the other hand, consider an arbitrary accepted 
interval Xj. The interval Xj contains Xj/5 ranges of size 6. Moreover, all these ranges end at the 
point Y^J= l Xj < T ■ B or earlier, and thus, their variables appear in the sum on the right hand 
side of the equality we want to prove. Hence, in conclusion, the contribution of Xj to that sum is 
exactly Xj/5. The observation now follows since the intervals {Xj}J =1 are disjoint, and thus, so are 
their contributions to the sum. □ 

Let / be the minimal integer such that SI > L. 

Observation E.3. Under Assumption I E. li Ya > S ■ Y/l= i U- 

Proof. Notice that T • B is an upper bound on the sum Y/J=i Xj. On the other hand, L is a lower 
bound on this sum, and thus, we get: T ■ B > L. Hence, 5(T ■ B/8) > L. Since the term T ■ B/S 
is an integer, the minimality of I implies I < T -B/S. Using Lemma IE.2I and the fact that the 
variables Yj are non-negative, we get: 

T-B/6 I 

Ya = S- Y. Y t >S-Y y i. ■ □ 

i =1 i=l 
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Observation IE.31 shows that it is enough for our purpose to prove a concentration bound for 
Yi. The following observation gives another useful property of I. 

Observation E.4. Under Assumvtion I E. 1 1 for every 1 < i < I, the range ( 5(i — 1), <5z) is always 
contained within some interval Xj. 

Proof. Assume towards a contradiction that there is some realization of the process under which 
the range (S(i — l),<5i) is not contained within some interval Xj. This implies: 

T 

L < Y, x 3 < 8(i - 1) < 6(1 - 1) , 

3 = 1 


contradicting the definition of I. 


□ 


Let us now study the distribution of the variables {lj}f =1 . 

Lemma E.5. Under Assumvtion I E. 1 1 Pr[Yi = 1] = p for every 1 < i < I. 
expectation: 


E 




i 

E E M = p J • 
2=1 


Hence, by linearity of 


Proof. For every j > 1, let £j be the event that j < T and (6(i — 1), Si) is included in the interval 
Xj. Observe that £j depends only the acceptance of intervals Xj> for j' < j. Moreover, given that 
Xj exists it is accepted with probability p, independently of the acceptance of previous intervals. 
Hence, we get: 

Pr[l • = 1 | £j]=p . 

The observation now follows by the law of total probability since Observation IE.41 guarantees that 
(5(i — 1 ),Si) is included in some interval, and thus, the event £j happens for exactly a single value 

of j- ' □ 


For every 1 < h < B/8, let us define Vh = {1 < i < I \ i = h (mod B/8)}. Observe that the 
sets { V h }^\ form a disjoint partition of the indexes from 1 to I. 

Observation E.6. Under Assumption \Efl J for every 1 < h < B/8 and two different indexes 
i,i' E Vh, the ranges (S(i — 1 ),8i) and (8(i' — 1 ),8i') cannot be contained in one interval Xj. 

Proof. Assume without loss of generality that i <il . The definition of Vh guarantees that i + B/8 < 
i'. Hence, 

8i' — 8(i — 1) = 8(i' — i) + 8 > B + 8 . 

Hence, any interval Xj containing both ranges (8(i — 1), Si) and (8(i' — 1 ),8i') must be of length at 
least B + 8, which contradicts the definition of the process. □ 

Lemma E.7. Under Assumption I E. li for every 1 < h < B/8, the variables of {Yi \ i E Vh} are 
independent. 

Proof. For every i E Vh, let V^ 1 denote the intersection Vh D {1, 2,..., i — 1}. To prove the lemma 
it is enough to show that for every i E Vh the variable Y % takes the value 1 with probability p 
conditioned on any assignment to the variables of {Yj \ j E V/f 1 }. Let A denote an arbitrary such 
assignment having a non-zero probability. For every 1 < £ < T, let £i be the event that the interval 
X( exists and contains the range (5(i — 1), Si). 
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Assume £g happens. By Observation IE.61 the variables of {Yj \ j E Vff 1 } correspond to ranges 
contained in intervals before Xg. Thus, the values of these variables only imply information about 
the acceptance of these intervals. Since the acceptance of Xg is independent of the acceptance of 
previous intervals, we get that every 1 < i < T obeying Pr[£^ | A] > 0 must also obey: 

Pr[T* = 1 | Eg, A] = p . 

Clearly the events {£g}T = ] are disjoint. By Observation IE.4I we also know that one of them 
must happen. Hence, we get: 

f 

Pr \Yi\A]= Y ?r[£e\A}-Fr[Y i = l\Sg,A]=p-Y PT [ £ e\A}=P • □ 

1 <£<T t=\ 

Pr[£^|.A]>0 


Corollary E.8. Under Assumption I E. 11 for every 1 < h < B/S, Var J2iev h Pi — P{L/B + 2). 
Proof. By Lemma lE.51 for every i E Vh , Var[V] = p{ 1 — p) < p. Thus, by Lemma IE.71 


Var 


i£V h 


= Y Var[V]< YP = P- 1^1 ' 
iev h iev h 


The definition of Vh guarantees that its size is at most: 


< 

■ I ■ 

< 

L / <5+1 


B/d 


B/S 


L + S L 

~ B + 1 - B +2 


To bound the variance of the sum J+=i Pi, we need the following simple technical lemma. 
Lemma E.9. For a set of random variables Z\, Z%, ■ ■ ■ Zg, each having a finite variance, 


□ 


Var 


' £ 

E z - 

. 2=1 


< Y v Var ^] 


\ 2=1 


Proof. 


Var 


■ £ 

A =1 


l l 


l l 


Y.Ec°v\z i ,z j ]<Y.Y. yVar[Zj] • VarfZ,-] = (Y /Var [Zi] ) . □ 

1=1 7=1 i=l 7=1 \i=l / 


i=l j=l i=l j=l 

Corollary E.10. Under Assumption \E. 1[ Var 'ffY Y{ < pB5~ 2 (L + 2B). 
Proof. Observe that: 

I B/S 

'£ Y ‘ = Z'£ Y ‘- 

i=l h= 1 i€Vh 

Hence, by Corollary IE. 81 and Lemma IE.91 


Var 


E* 

•i=i 


< 


B 


/p|| + 2 


pB 


(L + 2B) . 


□ 


We are now ready to prove the promised concentration bound for Ea- 
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Lemma E.ll. Under Assumvtion \E. 1[ for every t > 0, PrfE^ < pL — t] < pB(L + 2 B)/t 2 . 
Proof. By Lemma IE. 31 


Pr[E^ < pL — t] < Pr 


< Pr 


5 - <pL -t 

i=1 
I 

5 • y~] Yj < 5 ■ pi — t 


2=1 


< Pr 


S-Y'Yi-S-pI 


2=1 


> t 


Since the expected value of 6 ■ J2i =i Yi is 6 ■ pi by Lemma IE.51 we get by Chebyshev’s inequality: 


Pr 


S-^Yi-S-pI 


2=1 


> t 


Var 


< 


8-ZLi Yi 


5 2 ■ Var 


Ef=i Y i 


2- < 


pB{L + 2 B) 


t 2 t 2 t 2 

where the last inequality holds by Corollary IE. 101 □ 

Finally, we would like to get a version of Lemma lE.111 that holds without Assumption lE.il 
Corollary E.12. For every t > 0, Pr[E^ < pL /2 — t] < pB(L + 2B)/(4t 2 ) 

Proof. For every value Xf, let us define a value X[ as follows: 

X 1 . = B/2^ og Y B / x i)i 


Intuitively, X[ is the smallest value allowed by Assumption IE. II that is at least as large as Xj. We 
say that X[ is accepted if and only if Xi is. One can verify that the following holds: 

• The values X[,X ' 2 ,..., X' T define a legal process with the same parameters p, B and L as the 
original process, and this process obeys Assumption lE.il Let Ti' a be the sum of the accepted 
values in this process. 

• For every value Xj, X- < 2X t , hence, Ti' a < 2 • 

Thus, by Lemma lE.111 


PrpA < pL/2 — 2] < Pr[E^ < pL — 22] < 


pB(L + 2 B) 

W 2 


pB(L + 2 B) 
42 2 


□ 


E.2 Proof for m* < f(OPT) /(256 (d + l) 2 ) 

In this section we prove that Algorithm [2] is 0(/3)-competitive when m* < f(OPT)/(256(d + l) 2 ). 
Recall that Algorithm [2] consists of two parts. Let us say that Algorithm [2] “applies the second 
option” when it executes the second part (the one that involves the aided algorithm). Notice that it 
is enough to show that the algorithm is 0(/3)-competitive when it applies the second option, since 
this option is applied with probability 1/2. 

We need some additional notation. First, we denote by 1 +1 the number of iterations performed 
by the loop on Line[6]of the algorithm (i.e., the 1 +1 iteration is the iteration at which the algorithm 
decides to leave the loop, and does not change A). Additionally, for every 1 < i < £, let A* and VFj 
denote the set A and the value W, respectively, immediately after the z-th iteration of this loop. 
For consistency, we also denote by Aq and Wq these set and value before the first iteration. Finally, 
for every 1 < i < £, let rij denote the element u chosen at iteration i of the loop. We begin the 
analysis of the small m* case by proving an upper bound on Wp (the final value of W). 
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Observation E.13. When Algorithm [21 applies the second option, Ai € X for every 0 < i < £. 

Proof. The observation holds since Algorithm O chooses at every iteration an element u and a set 
V u whose addition to A does not violate independence. □ 

Lemma E.14. When Algorithm [2| applies the second option, Wg < f(OPT). 

Proof. We prove by induction on i the claim that the inequality Wj < f(Af) holds for every 
0 < i < l. Notice that the observation follows from this claim since, by Observation IE. 131 Ag is 
independent, and thus, f(Ag) < f(OPT). 

For i = 0 the claim is trivial since Wq = 0 = /(0) = f(Ao). For i > 0, assume the claim holds 
for i — 1, and let us prove it for i. 


Wi = Wi_i + f( Ui | Ai-x U V Ui ) < /(A-t) + f(ui 
< /(Aj_i U P Ui ) + f(ui | Aj_i UD„,) = f(Ai) 


Ai_ iUPJ 


where the first inequality holds by the induction hypothesis, and the second inequality follows from 
the monotonicity of /. □ 


Our next objective is to prove a lower bound on Wg that holds with a constant probability. 
For that purpose, let us consider an offline algorithm which calculates a value W having the same 
distribution as the value W calculated by Algorithm [2J 


Algorithm 7: Offline W Calculation 


1 Let A <— 0, W <— 0 and T A f . 

2 while there exist u € T \ A and T> u C T> + (u) s.t. A U T> u + u G X do 

3 

4 

5 

6 


Find such a pair maximizing f(u \ A U X\t). 
with probability (d + 2)~ 1 do 

Increase W <— W + f(u \ A U V u ). 
Update A <— A U V u + u. 

otherwise Update T T — u. 


Observation E.15. The distribution of the value W calculated by Algorithm^ is identical to the 
distribution ofWg as calculated by Algorithm^ when it applies the second option. 

Proof. The loop starting on Line [6] of Algorithm [2] looks in each iteration for a pair of an element u 
and a set T> u maximizing f(u \ AuT> u ) and obeying some other conditions, including the requirement 
that u belongs to a set T containing every element with probability (d + 2) _1 , independently. 

On the other hand, Algorithm 0 has a loop that looks for a pair of an element u and a set T> u 
maximizing f(u \ A U T> u ) and obeying the same conditions, except for requiring u to belong to T. 
Once such a pair is found, the algorithm makes a random decision whether to keep u in T, and 
then uses the pair to increase the solution A if and only if it decides to keep u in T. 

Notice that the only difference between these two procedures is the point when the algorithm 
decides whether each element u should belong to T. Algorithm [2] makes the decisions for all elements 
at the beginning, while Algorithm [3] makes the decisions only when necessary. However, regardless 
of when the membership of elements in T is decided, the set of pairs used to increase the solution 
A is the same given that the same random decisions are made by both algorithms. □ 

To analyze the distribution of the value W calculated by Algorithm [3] we need some additional 
notation. Let £+ 1 be the number of iterations performed by the loop of Algorithm [3] (i. e., t is the 
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number of times elements are added to the set A), and for every 1 < i <1 let A % . T % and W denote 
the sets A and T and the value W, respectively, immediately after the i-th iteration of this loop. 
For consistency, we also denote by Aq, Tq and Wo these sets and value before the first iteration. 
Additionally, for every 1 < i < £, let Ui denote the element u chosen at iteration i of the loop. 
Finally, for every 0 < i < £, let us denote by OPTi the maximum value independent set that can 
be obtained from Ai by adding only T elements. Formally, 

OPTi = argmax f(S) . 

Sei\AiCScAiLifi 


Observe that OPTi is well defined since the set A, is independent for every 0 < i < £. 

Next, observe that Algorithm [3] can be viewed as a process of the kind described in Section rE.il 
More precisely, each iteration i of Algorithm [3] corresponds to one iteration of the process, the 
value of this iteration is f(v,i \ A t _\ U T>u i ) and this value is accepted if and only if iii is kept in T 
(and f(ui | A,_i U ) is added to W ). Note that, as required by the process definition, the value 
f(ui | Aj_i U T >{ h ) depends only on the acceptances of previous values, and the acceptances of the 
value f(ui | Aj_i U T>uf) is independent of anything else. Taking this point of view, Wt is exactly 
the sum of the accepted values, and thus, can be analyzed using Corollary IE. 121 To use the last 
corollary, we need to determine the parameters of the process: 

• By definition, m* upper bounds f(ui | (A*_i U T >{ H ) fl V + (u )) > f(ui | Aj_] U Vf H ). Hence, 
one can choose B = f(OPT)/[256(d + l) 2 ] > m* for the process given by Algorithm O 

• Every value is accepted with probability (d + 2)^ x , thus, this is the value of p. 

We are left to determine a possible value for the parameter L. For that purpose we need a few 
claims. 

Observation E.16. OPT ^ = A- ( . 

Proof. By definition OPT'j> contains the elements of A^ and (possibly) additional elements of Tg 
that do not violate independence when added to At. However, the fact that Algorithm [3] stopped 
during the l+l iteration implies that no elements of T) can be added to A^ without violating 
independence. Therefore, OPTj, cannot contain any elements beside the elements of At. □ 

For every 0 < i < t, let Li be the sum of the first i values of the process corresponding to 
Algorithm [31 Formally, 

i 

Li = Y. I Ai-i u Vtij) ■ 

3 =1 

Lemma E.17. For every 0 < i < £, /(A*) < (d + 1) • Lj. 

Proof. We prove by induction on i that for every 0 < i < £: 

f(Ai) < (d + 1) • ^ f(uj | Aj-i u p fl .) . (l) 

Uj €.Ai 

Observe that the lemma follows from the last claim since Uj can belong to the set Ai only when 
j < i. For i = 0, Equation (jTD is trivial since f(A 0 ) = f(0) = 0. Next, assume Equation ([T]) holds 
for i — 1 > 0, and let us prove it for i. If Hi is removed from T then this is true since in this case 
Ai = A ? ;_i. Thus, we can safely assume in the rest of the proof that Ui remains in T. 

For every 0 < i < £, let Ni = {v,j 0 A, | 1 < j < i}. Order the elements of Aj \ A,_i in an 
arbitrary order, and let Vj denote the j- th element in this order. Consider some 1 < j < |Aj\ Aj_i|, 
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if Vj 0 iVj_i then the pair of the element Vj and the set {vi,v 2 , ■ ■ ■, 1 } 0 V + (vj ) form a possible 

pair that Algorithm [3] could select on iteration i since Vj € J\f\ (IVj_ 1 U A*_i) = Tj_i \ Aj_i. Hence, 

f(iu | Aj_i UPflJ > f(vj | U ({ui,u 2 ,... nP + (uj))) 

> f(Vj | U {fi, u 2 ,..., u*_i}) . 

On the other hand, if Vj € JVj_i, then there must be some 1 < /i < * such that Vj = u^, and thus, 
Vj E T/j. By a similar argument to the one used above we get: 

f(u h | A h _lUDiJ > f (vj | i ft _i U U {«i, r 2 , • • •, «i-i}) n H + («j))) 

> f(vj | A h _i U (Aj_i U {ui, u 2 ,..., u*_i})) 

= f(vj | Aj_i U {ui, v 2 , • • •, . 

Adding the inequalities we got for every 1 < j < |A* \ Aj_i| gives: 

|Ai\Ai_i[ 

f(vj\A i - 1 U{v 1 ,v 2 ,...,Vi-i}) 

3 =1 

= X! /(^j I A-i u {ui,u 2 ,... f(vj I Aj_i U {ui,u 2 ,... ,Uj-i}) 

u ;) -eA i \(A i _iUiv i _i) uj-eAinJVi-i 

< |Ai \ (Aj_ 1 U fVj_i)| • /(uj | 1 U P fii ) + f(u h | A h _i U T>u h ) . 

UhGAinATi-i 

Observe that |A* \ (Aj_i U JV*_ 1 )| < d + 1 and Ai O iVj_i C i ; \ Aj_i — Ui . Combining both 

observations with the last inequality yields: 

f(Af) < f(Ai_i) + (d + 1) ■ f(v,i | Aj_i U 'Duj) + ^ f(uh | -A/i-i U P« h ) 

< f(Ai-i) + (d + 1) • ^ /(4 | Afc-i UPiJ < (rf + 1) ■ /(“h I ^ft-i ^ ^ 4 ) > 

where the last inequality holds by the induction hypothesis. □ 

Lemma E.18. For every 0 < i < 1, f(OPT L ) + (d + 1) ■ Lj > f(OPT). 

Proof. We prove the lemma by induction on i. For i = 0 the lemma is trivial since /( OPTq ) = 
f(OPT). Next, assume that the lemma holds for i — 1 > 0, and let us prove it for i. There are 
two cases to consider. First, let us consider the case where Ui is removed from T and A, = A,_i. 
In this case one potential candidate for OPTj is OPT ;-1 — Ui. If U{ OPTi- 1 , then we get 
f(OPTi_ 1 — Ui) = f(OPTi_ 1 ). On the other hand, if Ui € OPT t -\ then the pair of the element Ui 
and the set (OPTi-i \ 1 ) n P + (tq) is a possible pair that Algorithm [3] could select on iteration 

i. Hence, 


f (OPTi-i 


Ui) = f {OPTi_i) - f(ui 

> f (OPT^) - f(ui 

> f (OPTi^) - f(ui 


Ai- 1 U (OPTi-i \ Ai- 1 - v,i)) 

A-i-i U ((OPTi-i \ Ai-i) n V + (ui))) 
A -lUPflJ . 


25 


Therefore, regardless of the membership of u t in OPT)_i, we can lower bound by /(OPT)_i — uf) 
by /(OPT)_i) — f{ui | i U T>uf)- Using the induction hypothesis, we now get: 

fiOPTi) + {d+l)-L i > fiOPTi _! - + (d + 1) ■ + (d + 1) ■ f(ui | ii_i U 25*) 

> /(OPTi-O + (d + 1) ■ > /(OPT) . 


Next, let us consider the case where Hi is kept in T and A{ = U "D* + iij. In this case, by 
standard matroid properties, one can obtain a candidate for OPT; by starting with OPT,-lUT^+tq 
and removing from it a subset A C OPT;_i \ A{ of up to d + 1 elements. Let us denote by OPT/ 
this candidate, i.e., OPT' = (OPT;_i U "D* + uf) \ A. 

Order the elements of A in an arbitrary order, and let Vj denote the j-th element in this order. 
Observe that by monotonicity: 


|A| 

f(OPT ,;_!) - /(OPT/) < f(OPTi-i) - /(OPTj_! \ A) = £ f( Vj \ OPT, i_ x \ {«,, i*,... , Vj }) 

3 = 1 


|A| 

< X! f ( v 3 I u ((OPTj_i \ {ui,u 2 ,... ,Uj}) n P + (uj))) < (d+ 1) • f(ui \ Aj_i UT> + (tq)) . 

i=i 


where the last inequality holds since Vj E OPT;_i \ A; C T)_i for every 1 < j < |A|, and thus, the 
pair of the element vj and the set [(OPp-i \ {ui, u 2 ,. .., Vj}) \ A,_i] 0 T ,+ (u :/ ) is a possible pair 
that Algorithm [3] could select on iteration i. Using the induction hypothesis, we now get: 

/(OPT,) + (d + 1) ■ Lj > /(OPT/) + (d + 1) ■ Li_i + (d + 1) ■ f(ui | A_i U 2?*) 

> f(OPTi-i) + (d + 1) ■ Tj-i > /(OPT) . □ 


Corollary E.19. The parameter L of the process corresponding to Algorithm 0 can he chosen to 
be f(OPT)/[2(d + 1)]. 

Proof. Observe that any value that always lower bounds Ly can be used as a value for the parameter 
L. Combining Lemmata lET7l and lE. 181 gives: 

2(d + 1) • L t > f{A t ) + [/(OPT) - /(OPT,-)] = /(OPT) , 

where the equality holds by Observation IE.161 □ 


Now that we have all the parameters of the process corresponding to Algorithm [3l we can use 
Corollary IE. 121 to give a guarantee on It/. 

Lemma E.20. When Algorithm^ applies the second option and m* < /(OPT)/[256(d+l) 2 ], then, 
with probability at least 7/8, H/ > . 

Proof. Recall that Ws is the sum of the accepted values in the process corresponding to Algorithm^ 
Hence, by Corollary IE. 121 


Pr 


w t < 


/(OPT) 1 

8(d + 2) 2 . 


< Pr 


>h<f 


pL 1 ^ pB(L + 2 B) 
Tj - 4(pL/4) 2 


4B(L + 2B) 


A .f(OPT) ( f(OPT) , 2 -f(OPT) \ 
* 2560 +W V 2(d+l) _r 256(d+l) 2 / 

1 . ( f{OPT) \ 2 

d -\-2 \ 2(d+l) y 


4_±_ T . _L_ 

2560+10 d+1 

~T~ 1 

d+ 2 ’ 4(d+l) 2 


+ 2 ) < 1 
16(d+ 1) “ 8 


The lemma now follows since It/ and It/ have the same distribution by Observation IE. 151 □ 


26 































To complete the analysis of Algorithm [2] we also need the following notation and lemma. Given 
a set S C AA, let S(p) be a random set containing every element u E S, independently, with 
probability p. 

Lemma E.21. For every set S C M, E[/(5(p))] > p d+1 ■ f(S). 

Proof. For every element u E S , let X u be an indicator for the event that S H V + {u) + u E S(p). 
Clearly, Pr[A u = 1] > p d+1 . Also, let U\,U 2 , • • • , U| 5 | denote an arbitrary order of the elements of 
S. Then, 

\S\ 

Hf( s (p))] = n S(p) | S(p) n {«i,« 2 , • ■ -,«i-i})] 

i= 1 

\S\ 

> • f{m I {ui,u 2 , ■ ■ ■ ,Ui_i})] 

i= 1 

|S| 

>p d+1 -J2f(ui I =p d+l ■ f{S) , 

2—1 


where the first inequality follows from the definition of T> + {u). □ 

Lemma E.22. If m* < f(OPT)/[256(d+ l) 2 ] , then Algorithm^ is 0(/3)-competitive. 

Proof. Throughout this proof we assume that Algorithm [2] applies the second option. This event 
happens with probability 1/2, thus, it is enough to prove that Algorithm [2] is 0(/3)-competitive 
given this event. 

Let R = Af\ Ti be the set of elements that are not observed by Algorithm [21 By Lemma lE.211 


E [f(OPTHR)\ = E 


/ OPT 1 - 


d + 2 


> 1 - 


J-) 

d + 2 / 


d +1 


f(OPT) > e” 1 • f{OPT) . 


Hence, 

E [f(OPT{R))\ > E [f(OPT n R)] > ^°^ . 

where OPT(R) is the independent subset of R maximizing /. Since f(OPT(R )) is always upper 
bounded by /(OPT), this implies the following claim: 


Pr 


f(OPT(R)) 


f(OPTy 

10 



On the other hand, by Lemma lE. 201 Wp > f(OPT)/[8(d + 2) 2 ] with probability at least 7/8. 
Hence, by the union bound we have: 


Wp> 


f(OPT) 
8(d + 2) 2 


and f(OPT(R)) > f( ° P J ) 


with probability at least 1/24. To complete the proof it is enough to show that ALG is 0(/3)- 
competitive when the last two inequalities hold. This follows from the definition of ALG when 
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°pt 8 o(rf-(- 2 ) 2 is a valid approximation for f(OPT(R)), i.e., when we have f(OPT(R))/[80(d + 2) 2 ] < 
°P t 80 (d+ 2 ) 2 — f{OPT{R)). Thus, in the rest of the proof we prove these inequalities: 

OPt80( rf+ 2) 2 = ^ < / (OPT(R)) , 

where the first inequality follows from Lemma lE. 141 On the other hand, 

f(OPT(R )) < f{OPT) < 8(d + 2) 2 Wj? = 80(d + 2) • opt 80(d+2)2 . □ 

Note that Theorem O follows immediately from Lemmata 1C.II and IE.221 
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